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(54) Data hiding method and data extracting method 



(57) I n order to hide message information (m) in me- 
dia information (M), the frequency transform of the mes- 
sage information (m) and the media information (M) are 
performed, and frequency spectra f., and f 2 are ob- 
tained. Next, from the frequency spectrum f 2 of the mes- 
sage information (m), a region containing feature fre- 
quency components representative of the features of 
the message information (m) in real space is extracted 



as the base region B. Then, n copies of the base region 
B are generated, and in frequency space, the n copies 
are dispersedly arranged. Also, by making the frequen- 
cy components other than that equal to zero, a frequen- 
cy spectrum f 3 is generated as intermediate information. 
Finally, the frequency spectrum f 3 and the frequency 
spectrum f t are added, and by performing the inverse 
frequency transform of the result, the message informa- 
tion (m) can be hidden in the media information (M). 
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Description 

The present invention relates to a data hiding meth- 
od for hiding message data into media data and a data 
extracting method for extracting hidden data. 

With the development of multimedia society, large 
quantities of digital video and audio information have 
been circulated on internet systems or as CD-ROM soft- 
ware. For digital video and audio information, any indi- 
vidual can easily create a perfect copy without degrada- 
tion, so the illegal use and copyright protection are be- 
coming problematic. In order to prevent a third party 
from illegally copying media data such as video and au- 
dio data, the technique of hiding additional information, 
such as the signature of an author, into original media 
data is becoming the focus of attention. When digital vid- 
eo data or other similar data is illegally copied, it could 
be known whether or not the copy is an illegal one by 
confirming the signature hidden in the copy and speci- 
fying the source. A hiding technique such as this is 
called data hiding. 

As for the data hiding technique, a technique such 
as mentioned as follows is disclosed in "Nikkei Electron- 
ics 4-22 1996," issued on April 22, 1996. Figure 1 is a 
block diagram for explaining a conventional data hiding 
and extracting method. In the process of hiding an iden- 
tification (ID) information, first, the frequency transform 
of media information, such as original dynamic images, 
still images, photographs, or audio, is performed by dis- 
crete cosine transform (DCT) or high-speed discrete 
Fourier transform (DFT) to obtain its frequency spec- 
trum. Spectrum diffused and convoluted to this frequen- 
cy spectrum of the media informal bn is ID information. 
The ID information is a random specified to each pur- 
chaser of a work. The random- number generating algo- 
rithm employs a normal distribution, and the length is 
1000. Then, the frequency spectrum of the media infor- 
mation to which the ID information was added is re- 
stored by inverse frequency transform tothe original dig- 
ital work having the hidden ID information. The digital 
work has the hidden ID information but is almost identi- 
cal with the original work, so the purchaser cannot per- 
ceive the difference visually. 

Conversely, in the process of extracting the ID in- 
formation, first, the frequency transform of the digital 
work, thought of as an illegal copy, is performed to obtain 
its frequency spectrum and the frequency spectrum of 
the original work is also obtained. Then, the difference 
between these spectra is taken and compared with the 
ID information issued by the author. With this compari- 
son, the purchaser of the work can be specified, so it 
can be judged whether or not the copy is illegal. 

One of the features of the aforementioned tech- 
nique is that frequency space is utilised for hiding data. 
That is, media information is transformed from real 
space to frequency space, and the frequency compo- 
nent in the frequency spectrum is operated based on ID 
information. Another feature is that a frequency region 



to be utilised is local. That is, the high-frequency com- 
ponent in the frequency spectrum of media information 
is not utilised and spectrum diffusion is performed only 
for a low-frequency region. The reason for this is that if 
ID information is diffused up to a high-frequency region, 
ID information is erased in compressing or decompress- 
ing the image. That is, this conventional technique spec- 
trum-diffuses ID information locally in the frequency 
space of media information and hides the ID information 
in the entire real space of the media information. 

The aforementioned conventional technique, how- 
ever, has the disadvantage that message information 
(ID information) can be easily filtered out from media in- 
formation by employing a frequency filter. For example, 
when media information with message information hid- 
den therein is processed by employing a high-pass filter 
which filters out only a low-frequency component, the 
component of the high-frequency band remains unal- 
tered, but the component of the low-frequency band in 
which message information has been hidden is com- 
pletely filtered out. Therefore, it is no longer possible to 
extract ID information from the media information that 
was output from a high-pass filter, and it is possible to 
hide different additional ID information. 

Accordingly, it is the objective of the present inven- 
tion to provide a data hiding method which has high re- 
sistance to removal or change of media information em- 
bedded into message information. Particularly, in the 
present invention it becomes possible to continue to ef- 
fectively maintain hidden message information even 
when signal processing is performed by employing a fre- 
quency filter. 

To solve the aforementioned problems, a first inven- 
tion provides a data hiding method which embeds mes- 
sage information into media information. The data hid- 
ing method comprises the steps of: 

(a) obtaining a frequency spectrum which contains 
a plurality of frequency components with respect to 
the message information; 

(b) extracting a base region from the frequency 
spectrum of the message information, the base re- 
gion being defined as a region which contains fea- 
ture frequency components that represent features 
of the message information in real space; 

(c) generating a plurality of copies of the base re- 
gion and generating a frequency spectrum obtained 
by dispersedly arranging the respective copies in 
frequency space, as intermediate information; and 

(d) hiding the message information into the media 
information by operating the frequency spectrum of 
the message information, based on the intermedi- 
ate information. 

Also, a second invention provides a data hiding 
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method which hides message information into media in- 
formation. The data hiding method comprises the steps 
of: 

(a) obtaining a frequency spectrum which contains s 
a plurality of frequency components with respect to 
the message information; 

(b) extracting a base region from the frequency 
spectrum of the message information, the base re- io 
gion being defined as a region which contains fea- 
ture frequency components that represent features 

ol the message information in real space; 

(c) generating a plurality of copies of the base re- is 
gion and generating a frequency spectrum obtained 

by dispersedly arranging the respective copies in 
frequency space, as intermediate information; and 

(d) hiding the message information into the media 20 
information by executing an arithmetic based on 
both the intermediate information and the media in- 
formation. 

Here, it is preferable that the base region contain a 25 
plurality of frequency components which represent ex- 
ternal features of the message information in real space. 
Particularly when the media information is image infor- 
mation, it is preferable that the base regions be consti- 
tuted by a plurality of low-frequency components which 30 
represent contour features of an image in real space. 

Also, in the step (b), a plurality of base regions may 
be extracted. In this case, for each of the base regions. 

The aforementioned copies each may have the 
same frequency component as the feature frequency 35 
component of the base region. Also, it is preferable that 
among the frequency components of the frequency 
spectrum as the intermediate information, frequency 
components other than positions at which the copies are 
arranged are zero. *o 

Furthermore, it is preferable that the arithmetic in 
the step (d) be a binomial arithmetic more specifically 
addition. Note that a step of performing inverse frequen- 
cy transform of a result obtained in the step (d) is exe- 
cuted when the arithmetic in the step (d) is executed in *s 
frequency space. 

A third invention provides a data extraction method 
which extracts message information from media infor- 
mation with the message information hidden multiply in 
frequency space. The data extraction method compris- so 
es the steps of: 

(a) obtaining, as first intermediate information, a fre- 
quency spectrum which has a plurality of base re- 
gions by operating the media information with the ss 
message information hidden therein, each of the 
base regions containing feature frequency compo- 
nents which represent features of the message in- 



formation in real space; 

(b) specifying at least one of the base regions from 
the frequency spectrum obtained as the first inter- 
mediate information; 

(c) generating a frequency spectrum obtained by ar- 
ranging the feature frequency components of the 
base region at predetermined positions on frequen- 
cy space, as second intermediate information; and 

(d) extracting the message information hidden into 
the media information by performing inverse fre- 
quency transform of the second intermediate infor- 
mation. 

A fourth invention provides a data extraction meth- 
od which extracts message information from media in- 
formation with the message information hidden multiply 
in frequency space. The data extraction method com- 
prises the steps of: 

(a) obtaining a frequency spectrum which has a plu- 
rality of frequency components with respect to first 
intermediate information obtained by an arithmetic 
based on both the media information with the mes- 
sage information hidden therein and the media in- 
formation with no message information hidden 
therein; 

(b) specifying at least one of the base regions from 
the frequency spectrum of the first intermediate in- 
formation each of the base regions containing fea- 
ture frequency components which represent fea- 
tures of said message information in real space; 

(c) generating a frequency spectrum obtained by ar- 
ranging the feature frequency components of the 
base region at predetermined positions on frequen- 
cy space, as second intermediate information; and 

(d) extracting the message information hidden into 
the media information by performing inverse fre- 
quency transform of the second intermediate infor- 
mation. 

Here, it is preferable that the arithmetic in the afore- 
mentioned step (a) be a binomial arithmetic. More spe- 
cifically, it is preferable that the arithmetic in the step (a) 
be the difference between the media information with 
the message information hidden therein and the media 
information with no message information hidden therein 

Also, on the frequency spectrum of the first interme- 
diate information, the base regions are dispersed and 
multiply arranged. 

In the aforementioned step (b), the base region may 
be specified by giving positional information which spec- 
ifies a position of the base region in frequency space. 
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Also, in the aforementioned step (c), the base re- 
gion may be arranged by giving an arranging rule which 
specifies a position of the base region in frequency 
space. 

Furthermore, it is preferable that among the fre- 
quency components of the frequency spectrum as the 
second intermediate information, frequency compo- 
nents other than positions at which the base regions are 
arranged are zero. 

In such a structure, a plurality of copies of the base 
region are dispersedly arranged in frequency space. 
That is, the base region is copied multiply in various fre- 
quency bands. Therefore, even in the case where the 
frequency component in a certain frequency band is fil- 
tered out by a frequency filer and where the copy ar- 
ranged on that portion is lost, the contents of the base 
region can be specified from the copies existing in other 
bands. The respective copies have the same frequency 
component as the feature frequency component of the 
base region. 

Therefore, if one copy can be specified, all feature 
frequency components can be extracted. Since the fea- 
ture frequency component represents the feature of the 
message information in real space, the message infor- 
mation can be reconstructed by extraction of the feature 
frequency component. 

How the invention may be carried out will now be 
described by way of example only and with reference to 
accompanying drawings in which: 

Figure 1 is a block diagram for explaining data hid- 
ing and extraction in background art; 

Figure 2 is a flowchart of a procedure of embedding 
data; 

Figure 3 is a schematic diagram for explaining data 
embedding; 

Figure 4 is a schematic view for explaining the state 
when base regions are arrayed in multiple in fre- 
quency space; 

Figure 5 is a flowchart showing a procedure of ex- 
tracting message information; 

Figure 6 is a schematic diagram for explaining data 
extraction; 

Figure 7 is a diagram showing the halftoned image 
of digital message information displayed on a dis- 
play device; 

Figure 8 is a conceptual diagram for explaining the 
embedding of data in the case where a plurality of 
base regions have been set; 

Figure 9 is a conceptual diagram for explaining the 



embedding of another data; 

Figure 10 is a block diagram of a system which em- 
beds message information into media information; 
s and 

Figure 11 is a block diagram of a system which ex- 
tracts message information from the media informa- 
tion with the message information hidden multiply 
io in frequency space. 

A data hiding method for hiding message informa- 
tion into media information will hereinafter be described 
based on Figures 2 and 3. Figure 2 is a flowchart of the 
is data embedding procedure in an embodiment of the 
present invention, and Figure 3 is a schematic diagram 
for explaining data embedding. 

Frequency Transform of Media Information M (step 21 ) : 

20 

First, media information (M) and message informa- 
tion (m) are prepared. The media information (M) is an 
object, in which data is hidden, and such information is 
either still image data, dynamic image data, voice data, 
25 or, etc. In the following description, still images such as 
those shown in Figure 3 are described as examples of 
a media image. 

Also, the message information (m) is information 
which is hidden into media information (M), and in the 
30 following description, a binary image "IBM" consisting 
of relatively low frequency components, such as the one 
shown in Figure 7(a), will be described as an example. 
The frequency transform of a still image as media infor- 
mation (M) is performed. The frequency transform can 
35 employ various known frequency transforms such as 
sine transform, Fourier transform, discrete Fourier 
transform (DFT), and wavelet transform. By employing 
frequency transforms suitable for the features of mes- 
sage information, conversion efficiency and reliability 
40 can be enhanced. By performing the frequency trans- 
form of media information (M), frequency spectrum f, is 
obtained. The frequency spectrum f 1 contains a great 
number of frequency components (a) in the form of a 
matrix, as shown in Figure 3, and the horizontal direction 
45 and vertical direction represent a one-dimensional di- 
rection and a two-dimensional direction, respectively. 
Note that the one-dimensional direction represents a 
higher frequency component as it goes right and the 
two-dimensional direction represents a higher f requen- 
50 cy component as it goes down. 

Frequency Transform of Message Information (m) (step 
22): 

55 By performing the frequency transform of message 
information (m) used as a binary image, there is ob- 
tained frequency spectrum f 2 where a great number of 
frequency components (b) are arrayed in the form of a 
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matrix. 

Extraction of Base Region B (step 23) : 

Base region B is extracted from the frequency spec- 
trum f2 of message information (m). This base region B 
is constituted by a plurality of feature frequencies which 
represent the external feature of message information 
(m) in real space. In a general image, a frequency com- 
ponent expressing the contour feature is often part of 
the entire frequency space. For example, a large and 
simple figure pattern, which can be easily perceived by 
humans, can be expressed only by relatively low-fre- 
quency components. 

Figure 7 depicts the halftoned image of digital mes- 
sage information displayed on a display device. The fre- 
quency spectrum of a binary image of Figure 7(a), con- 
stituted by 384 x 256 pixels, is obtained and a specific 
low-frequency band is made a base region. Here, as an 
example, each frequency component expressed by bij 
(where 1_L 50 (one-dimensional direction) and 1_j_50 
(two-dimensional direction)) is specified as a base re- 
gion. Then, the frequency components that are not con- 
tained in the base region are all made zero. Next, a new 
frequency spectrum obtained by this operation is re- 
turned to real space again, and consequently, an image 
such as that shown in Figure 7(b) is obtained. Although 
the picture quality has been degraded due to the dull 
edge of the image and fringe patterns, the picture quality 
has been maintained to the extent that requisite infor- 
mation can be extracted. From this resuft it follows that 
the contour feature of a simple image can be roughly 
expressed only by the low-frequency components within 
the aforementioned base region. Therefore, each fre- 
quency component within this base region is referred to 
as a feature frequency component. 

The base region can be determined automatically 
by referring to the value of each frequency component 
of a frequency spectrum. In general, it can be said that 
a frequency component with a larger value is an impor- 
tant component to express the image. Therefore, by pre- 
setting a threshold value which gives a criteria for judg- 
ing the importance of a frequency component, the fre- 
quency components of a frequency spectrum greater 
than threshold value are all extracted. Then, a place at 
which extracted frequency components are particularly 
concentrated (a place forming a region greater than a 
predetermined standard) is specified as a base region. 

Figure 4 is a schematic view for explaining the state 
when base regions are multiply arrayed in frequency 
space. In the frequency spectrum f 2 shown in the figure, 
the frequency component in the upper right region indi- 
cated by oblique lines is the feature frequency compo- 
nent. Hence, a block containing a low-frequency com- 
ponent representative of a binary image feature is cut 
out as the base region. 

It is noted that the feature frequency component 
does not always have to be a low-frequency component. 



The reason is that, in an image with a complicated ex- 
ternal feature such as a lattice pattern, a higher-frequen- 
cy component becomes of importance and therefore the 
feature frequency component can be different depend- 
5 ing upon images. Therefore, the base region in the 
present invention is defined as a region which contains 
a plurality of feature frequency components represent- 
ative of external features of message information (m) in 
real space. Thus, the feature frequency component is 
10 not limited only to a region containing a low-frequency 
component. 

Generation of Intermediate Information (step 24): 

The basis reference region B obtained in step 23 is 
repeatedly copied n times in frequency space to gener- 
ate intermediate information (frequency spectrum f 3 ). 
That is, n copies of the base region are generated and 
the respective copies are dispersedly arranged in fre- 
quency space. 

In the frequency spectrum f 3 shown in Figure 4, n 
copies of the base region B extracted in step 23 are dis- 
persedly arranged. Each frequency component of the 
arranged respective copies has the same value as a fea- 
ture frequency component corresponding to the position 
of the base region. For example, each frequency com- 
ponent of the uppermost row in a certain position at 
which the copy in the frequency spectrum f 3 is arranged 
is caused to correspond to the frequency component of 
the uppermost row in the base region, so the frequency 
component becomes M b11, b12, b1m" from left. This 
operation is executed for all positions at which the cop- 
ies of the base region B are arranged. Then, the fre- 
quency components other than the positions at which 
copies are arranged (the parts in the frequency spec- 
trum f 3 of Figure 4 indicated by oblique lines) are made 
equal to zero. 

In step 23, positional information on where the po- 
sition of the aforementioned n copes are arranged in fre- 
quency space is stored as a key K. This key K becomes 
necessary when message information is extracted. Al- 
so, information on the size of a base region is stored as 
the occasion may demand. The arranged positions may 
always be fixed positions regardless of an object to be 
embedded, but it is preferable to make the arranged po- 
sitions different for each object from the viewpoint of a 
countermeasure to change. 

In the case where the arranged positions are 
changed according to circumstances, a position series 
generation algorithm, such as that described for exam- 
ple in Japanese Patent Application No.8- 159330, can 
be employed. When the algorithm is employed, the po- 
sitions are arranged so that they are put aside to a spe- 
cific band within frequency band, in view of resistance 
to frequency filters. 
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Addition of Frequency Spectra (step 25): 

The frequency spectrum f 3 of an intermediate infor- 
mation and the frequency spectrum f 1 of media informa- 
tion (M) are added. s 

Inverse Frequency Transform (step 26) : 

The inverse frequency transform of the frequency 
spectrum, obtained by the addition in step 25, is per- io 
formed to obtain media information (M') in which mes- 
sage information was hidden. Because this media infor- 
mation (M l ) has been given a process based on mes- 
sage information (m), it is different from the original me- 
dia information (M) in the strict sense of the word. How- is 
ever, the difference is nearly unperceivable. 

The addition of frequency spectrum f3 and media 
information can be performed in real space. !n this case, 
it is desirable to multiply data to be embedded by a con- 
stant so that the inverse frequency transform of the fre- so 
quency spectrum f 3 is performed to maximally suppress 
degradation of picture quality which is caused due to ad- 
dition, and it is desirable to shorten the value. 

Note that in the case where addition is performed 
for each pixel in real space in this way, it is a matter of 25 
course that step 26 does not need to be executed. 

Extraction of Data 

For a method of extracting message information 30 
from media information (fvV) with the message informa- 
tion multiply hidden in frequency space by the afore- 
mentioned hiding method, a description will next be de- 
scribed based on Figures 5 and 6. Figure 5 is a flowchart 
showing the procedure of extracting message informa- 35 
tion, and Figure 6 is a schematic diagram for explaining 
extraction of data. Basically, the following data are need- 
ed in order to extract message information. 

(1) Media information (M*) in which message data 40 
was hidden 

(2) Original media information (M) 

(3) Key K as positional information 

Generation of Differential Information (I) (First 45 
Intermediate Information) (step 51) : 

By taking in real space the difference between the 
media information (M*) in which message information 
(m) was hidden and the original media information (M), so 
differential information is obtained as first intermediate 
information. 

Frequency Transform of Differential Information (I) (step 
52): 55 

By performing the frequency transform of the differ- 
ential information (1), the frequency spectrum f 4 is ob- 



tained. Note that the difference in step 51 may be cal- 
culated not only in real space but also in frequency 
space. 

In this case, the frequency spectra of the media in- 
formation (M 1 ) and the original media information (M) are 
first obtained. Then, the difference between these is tak- 
en in frequency space to obtain the frequency spectrum 
f 4 . In the frequency spectrum f 4 obtained in this way, the 
base regions are dispersed and multiply arranged. 

Extraction of Base Region B (step 53) : 

At least one base reg ion B is extracted from the fre- 
quency spectrum f 4 . In order to specify the position of 
the base region B in frequency space, the key K is em- 
ployed, and at least one region of the same size as the 
base region is taken out. On the premise that the base 
region was taken out, there is the need to inform an ex- 
tractor of the size of the base region, for example, by 
containing it in the key information. If media information 
(M') is in an ideal state having no damages, the frequen- 
cy spectrum f 4 is identical with the aforementioned fre- 
quency spectrum f3. Therefore, since n base regions 
have been arranged in multiple, a maximum of n base 
regions can be taken out in the ideal state. However, 
when the media information (IvV) is damaged by a fre- 
quency filter or an analogue copy, some of the base re- 
gions are lost. Therefore, there are cases where the n 
base regions cannot be all taken out. However, it is not- 
ed that, even in this case, if any one of the base regions 
can be extracted, the message information can be ef- 
fectively extracted. 

Generation of Second Intermediate Information (D (step 
54}: 

By arranging the base region B at a predetermined 
position on the frequency space, frequency spectrum f 5 
is generated as second intermediate information (I'). 

By arranging the base region B at a predetermined 
position on the frequency space, frequency spectrum f 5 
is generated as second intermediate information (I"). An 
arrangement rule about where the position of the base 
region B is arranged in frequency space has previously 
been given, and the base region B is arranged according 
to this rule. In this embodiment, it is assumed that the 
base region B is arranged at the upper left portion which 
is the low-frequency region of the frequency spectrum 
f 5 (see Figure 6). The value of each frequency compo- 
nent in the frequency spectrum f 5 is the value of the fea- 
ture frequency component of the basic frequency region 
B for the portions where the base regions B were ar- 
ranged, and the frequency components other than that 
are made zero. 

Inverse Frequency Transform (step 55): 

The inverse frequency transform of the frequency 
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spectrum f 5 is performed. With this, the message infor- 
mation (m) hidden into the media information (M') is ex- 
tracted. 

Thus, in the aforementioned method, since the 
base regions representing the feature of the message s 
information are dispersedly arranged in multiple in the 
frequency space of the media information, the embed- 
ded message information has high resistance to 
processing based on a frequency fitter and to analogue 
copying. The frequency filter is operated locally with re- io 
spect to the frequency space, so there is a low possibility 
that all of the aforementioned copies are destroyed. 
Therefore, if any of the copies is present without being 
damaged, the base region can be specified and there- 
fore the message information can be interpreted. is 

An embedded message information can not be tak- 
en out if the original media information (M) and the key 
K are not present. Therefore, the original media infor- 
mation and the key information fulfil the role of a key for 
cryptograph ing, so the forgery or change of data is very so 
difficult. 

Furthermore, the amount of data required for taking 
out embedded data is reduced and of practical use. The 
data required for taking out are original media informa- 
tion (M) and a key K for specifying the copied position 25 
of the base region. 

While the aforementioned embodiment has been 
described with reference to the case of a single base 
region, the present invention is applicable to even the 
case of a plurality of base regions. Figure 8 is a concep- 30 
tual diagram for explaining embedding of data in the 
case where a plurality of base regions have been set. 
By extracting 7 base regions (B, through By) from the 
frequency spectrum f 2 of message information and mak- 
ing multiple copies of all base regions, frequency spec- 35 
trum f 3 is generated as intermediate information. Note 
that Figure 8 depicts the case of a multiple degree 2. 

Also, although the aforementioned embodiment 
has been described with reference to the case where 
addition and subtraction are used for the embedding 40 
and extraction of data, the present invention is not lim- 
ited to addition and subtraction. That is, the present in- 
vention can employ any algorithm so long as original da- 
ta can be reconstructed from the result of an algorithm 
performed based on certain data when data is hidden. 45 
Therefore, various algorithms, including a reversible bi- 
nomial algorithm, are considered. 

The algorithm used here is a conception including 
replacement of frequency components. Figure 9 is a 
conceptual diagram for explaining the embedding of da- so 
ta in another embodiment. The frequency transform of 
media information (M) is performed to obtain frequency 
spectrum f, and, at the same time, base region B is cop- 
ied multiply in frequency space to obtain frequency 
spectrum f 3 as intermediate information. The procedure ss 
up to here is the same as the description of Figure 3. 
Next, as an algorithm, among the frequency compo- 
nents (a) of the frequency system f t corresponding to 



the positions of the base regions, only the frequency 
components corresponding to the positions of the base 
regions B are replaced. 

That is, the respective frequency components (a) 
corresponding to the positions of the regions (i.e., cop- 
ies of base regions) indicated by oblique lines in the fre- 
quency spectrum f 3 of Figure 9 are replaced with corre- 
sponding feature frequency components (b). 

In the aforementioned way, all frequency compo- 
nents (a) in the regions corresponding to the arranged 
positions of copies in frequency spectrum f, are re- 
placed with the feature frequency components (b) of the 
base regions B. The information on the positions at 
which copies are arranged is stored as a key K. By per- 
forming the inverse frequency transform of the frequen- 
cy spectrum given such an arithmetic of replacement, 
media information (M') embedded with message infor- 
mation is obtainable. 

On the other hand, in the case where message in- 
formation is extracted, the positions at which copies are 
arranged in the frequency space of the media informa- 
tion (M') are specified based on the stored key K. There- 
after, by executing a similar procedure as the aforemen- 
tioned method, the message information can be extract- 
ed. 

In the embodiment of Figure 9, the feature frequen- 
cy components of the basic regions B are directly the 
frequency components of the media information, so 
there is no need to employ the original media informa- 
tion (M) when message information is extracted. That 
is, message information can be extracted only by media 
information (M') and key K. Therefore, there is the ad- 
vantage that the amount of data to be stored can even 
be further reduced. 

In the embodiment of Figure 9, in order to suppress 
degradation of picture quality of media information it is 
desirable to make copies at the positions of the frequen- 
cy spectrum f., which are unlikely to have too-great of 
an influence on picture quality. Although the processing 
of an image depends upon the feature of message in- 
formation, it is undesirable, in a simple image such as 
that shown in Figure 7, to operate a tow-frequency band 
form the viewpoint of prevention of picture quality deg- 
radation. Nevertheless, even if a frequency band which 
is too high were operated, the copy of the base region 
would be easily lost if a third party processes data. 
Therefore, in such an image it is desirable to arrange 
copies at an intermediate frequency band. 

A description will be made of a system which real- 
ises the aforementioned data hiding method. Figure 10 
is a block diagram of a system which embeds message 
information into media information. On a storage device 
11 , media information (M) and message information (m) 
are stored. 

Frequency transform means 12 obtains the fre- 
quency spectra of media information (M) and message 
information (m). The obtained frequency spectra are 
stored on a predetermined address region of main mem- 
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ory 13. Base-region extraction means 14 extracts base 
region B from the frequency spectrum of the message 
information (m) stored on the main memory 13, and the 
base region B contains feature frequency components 
which represent the feature of the message information 
in real space. Intermediate-information generation 
means 15 generates a plurality of copies of the base 
region B obtained by the base-region extraction means 

14 and also generates a frequency spectrum as an in- 
termediate information by dispersing and arranging the 
respective copies of the base region B in frequency re- 
gion. The intermediate-information generation means 

15 further outputs a key K which is information on the 
arranged positions of the copies of the base region B. 
Algorithm means 16 hides message information (m) into 
media information (M) by operating the frequency spec- 
trum of the media information stored on the main mem- 
ory 13, based on the frequency spectrum as an inter- 
mediate information. The inverse frequency transform 
of the operated frequency spectrum is performed by in- 
verse frequency transform means 1 7, and media infor- 
mation M' embedded with message information is out- 
put 

A description will next be made of a system which 
realises the aforementioned data extracting method. 
Figure 11 is a block diagram of a system which extracts 
message information from the media information with 
the message information hidden multiply in frequency 
space. On a storage device 21 , media information (M 1 ) 
is stored. First generation means obtains a frequency 
spectrum as first intermediate information by operating 
the media information (M 1 ). 

The first intermediate information has a plurality of 
base regions B including a feature frequency compo- 
nent representative of the feature of message informa- 
tion in real space and is stored on a predetermined ad- 
dress region of main memory 1 3. Specification means 
24 specifies at least one base region B from frequency 
spectrum of the first intermediate information stored on 
the main memory 23, based on an archived key K. Sec- 
ond generation means 25 generates a frequency spec- 
trum as second intermediate information by arranging 
the feature frequency component of the base region B 
at a predetermined position on frequency space. Then, 
extraction means 26 extracts and outputs the message 
information (m) hidden in the media information (M') by 
performing the inverse frequency transform of the fre- 
quency spectrum of the second intermediate informa- 
tion. 

The program which realises the aforementioned da- 
ta embedding method is generally stored on a storage 
medium. Here, the storage medium is, for example, a 
floppy disk, a hard disk, CD-ROM, MO, DVD, a semi- 
conductor memory, and so on. This program has the fol- 
lowing steps of: 

(a) obtaining a frequency spectrum which contains 
a plurality of frequency components with respect to 



said message information; 

(b) extracting a base region from said frequency 
spectrum of the message information, the base re- 

s gion being defined as a region which contains fea- 
ture frequency components that represent features 
of said message information in real space; 

(c) generating a plurality of copies of said base re- 
10 gion and generating a frequency spectrum obtained 

by dispersedly arranging the respective copies in 
frequency space, as intermediate information; and 

(d) hiding said message information into said media 
'5 information by operating said frequency spectrum 

of the message information, based on said interme- 
diate information. 

In addition, the program which realises the afore- 
20 mentioned data extraction method has the following 
steps of: 

(a) obtaining, as first intermediate information, a fre- 
quency spectrum which has a plurality of base re- 

25 gions by operating said media information with said 
message information hidden therein, each of the 
base regions containing feature frequency compo- 
nents which represent features of said message in- 
formation in real space; 

30 

(b) specifying at least one of the base regions from 
said frequency spectrum obtained as said first in- 
termediate information; 

3S (c) generating a frequency spectrum obtained by ar- 
ranging said feature frequency components of said 
base region at predetermined positions on frequen- 
cy space, as second intermediate information; and 

40 (d) extracting said message information hidden into 
said media information by performing inverse fre- 
quency transform of said second intermediate infor- 
mation. 

4 5 The data hiding technique employing the present 
invention in the aforementioned way has high resistance 
to processing based on a frequency fitter and to ana- 
logue copying, and consequently, the data hiding tech- 
nique can effectively maintain and extract hidden mes- 

50 sage information even when such processing is per- 
formed. 

The data hiding method of the present invention 
may have any of the following features: 

55 a) The base region contains a plurality of frequency 
components which represent external features of 
said message information in real space; 
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and multiply arranged; 

m) In specifying at least one of the base regions (B) 
from said frequency spectrum obtained as said first 
s intermediate information (f 3 ) said base region can 
be specified by giving positional information which 
specifies a position of said base region in frequency 
space. 

10 n) In generating a frequency spectrum obtained by 
arranging said feature frequency components of 
said base region (B) at predetermined positions on 
frequency space, as second intermediate informa- 
tion (54); and base region can be arranged by giving 

is an arranging rule which specifies a position of said 
base region in frequency space. 

o) Among the frequency components of said fre- 
quency spectrum as said second intermediate in- 
20 formation, frequency components other than posi- 
tions at which said base regions are arranged are 
zero. 



25 Claims 

1 . A data hiding method for embedding message in- 
formation (m) in media information (M) character- 
ised by the steps of: 

30 

(a) obtaining a frequency spectrum (1 V f 2 ) 
which contains a plurality of frequency compo- 
nents with respect to said message information 
(m); 

35 

(b) extracting a base region (B) from said fre- 
quency spectrum of the message information, 
the base region (B) being defined as a region 
which contains feature frequency components 
that represent features of said message infor- 
mation in real space; 

(c) generating a plurality of copies (r|) of said 
base region and generating a frequency spee- 
ds trum (f 3 ) obtained by dispersedly arranging the 

respective copies in frequency space, as inter- 
mediate information; and 

(d) hiding said message information (m) in said 
so media information (M) either by operating said 

frequency spectrum (f 1( f 2 ) of the message in- 
formation, (m) based on said intermediate in- 
formation or by executing an arithmetic based 
on both said intermediate information (f 3 ) and 
55 said media information (M). 



b) The media information is image information and 
said base region is constituted by a plurality of low- 
frequency components which represent contour 
features of an image in real space; 

c) A plurality of base regions are extracted from said 
frequency spectrum of the message information, 
the base regions being defined as regions which 
contain feature frequency components that repre- 
sent features of said message information in real 
space. 

d) The copies each have the same frequency com- 
ponent as the feature frequency component of the 
base region; 

e) Among the frequency components of said fre- 
quency spectrum as said intermediate information, 
frequency components other than positions at 
which said copies are arranged are zero; 

f ) The step of generating a plurality of copies (n) of 
said base region and generating a frequency spec- 
trum (f 3 ) obtained by dispersedly arranging the re- 
spective copies in frequency space, as intermediate 
information is executed for each of the base regions 
extracted by step c) above; 

g) The step of hiding said message information in 
said media information wherein the arithmetic is a 
binomial arithmetic; 

h) The arithmetic in g) above is addition; 

i) The data hiding method may, further comprise a 
step of performing inverse frequency transform of a 
result obtained in g) above when the arithmetic is 
executed in frequency space. 

The data extraction method of the present invention 
may have any of the following features: 

j) In the step of obtaining a frequency spectrum (f v 
f 2 ) which has a plurality of frequency components 
with respect to first intermediate information (f 3 ) ob- 
tained by an arithmetic based on both said media 
information (m) with said message information hid- 
den therein and said media information (m) with no 
message information hidden therein the arithmetic 
is a binomial arithmetic; 

k) In j) above the arithmetic is the difference be- 
tween said media information with said message in- 
formation hidden therein and said media informa- 
tion with no message information hidden therein; 

I) On said frequency spectrum of the first interme- 
diate information, said base regions are dispersed 



2. A data extraction method for extracting message in- 
formation (m) from media information (M) with said 
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message information hidden in frequency space, 
characterised by the steps of: 

(a) obtaining, as first intermediate information 
(f3), a frequency spectrum which has a plurality £ 
of base regions (B) by operating said media in- 
formation (M) with said message information 
(m) hidden therein, each of the base regions 
(B) containing feature frequency components 
which represent features of said message in- 10 
formation (m) in real space; 

(b) specifying at least one of the base regions 
(B) from said frequency spectrum obtained as 
said first intermediate information (f 3 ); '5 

(c) generating a frequency spectrum obtained 
by arranging said feature frequency compo- 
nents of said base region (B) at predetermined 
positions on frequency space, as second inter- 20 
mediate information (54); and 

(d) extracting said message information (m) 
hidden in said media information (M) by per- 
forming inverse frequency transform of said 2s 
second intermediate information (54). 

3. A data extraction method for extracting message in- 
formation (m) from media information (M) with said 
message information hidden in frequency space, 30 
characterised by the steps of: 

(a) obtaining a frequency spectrum (f v f 2 ) 
which has a plurality of frequency components 
with respect to first intermediate information (f 3 ) 3$ 
obtained by an arithmetic based on both said 
media information (m) with said message infor- 
mation hidden therein and said media informa- 
tion (m) with no message information hidden 
therein; 40 

(b) specifying at least one of the base regions 
(B) from said frequency spectrum of the first in- 
termediate information (f 3 ) each of the base re- 
gions containing feature frequency compo- 
nents which represent features of said mes- 
sage information in real space; 

(c) generating a frequency spectrum obtained 

by arranging said feature frequency compo- &> 
nents of said base region (B) at predetermined 
positions on frequency space, as second inter- 
mediate (54) information; and 

(d) extracting said message information (m) ss 
hidden into said media information (M) by per- 
forming inverse frequency transform of said 
second intermediate information (54) 



4. A system for embedding message information (m) 
in media information (M), characterised by the steps 
of: 

(a) means (12) for obtaining a frequency spec- 
trum (f v f 2 ) which contains a plurality of fre- 
quency components with respect to said mes- 
sage information (m); 

(b) an extractor (14) for extracting a base region 
(B) from said frequency spectrum of the mes- 
sage information (m), the base region (B) being 
defined as a region which contains feature fre- 
quency components that represent features of 
said message information (m) in real space; 

(c) a generator (15) for generating a plurality of 
copies of said base region (B) and generating 
a frequency spectrum obtained by dispersedly 
arranging the respective copies in frequency 
space, as intermediate information (f 3 ); and 

(d) means (16) for hiding said message infor- 
mation (m) in said media information (m) by op- 
erating said frequency spectrum of the mes- 
sage information (m), based on said intermedi- 
ate information (f 3 ). 

5. A system for extracting message information (m) 
from media information (M) with said message in- 
formation hidden in frequency space, characterised 
by the steps of: 

(a) means (12) for obtaining, as first intermedi- 
ate information (f3), a frequency spectrum (f v 
f 2 ) which has a plurality of base regions (B) by 
operating said media information (M) with said 
message information (m) hidden therein, each 
of the base regions (B) containing feature fre- 
quency components which represent features 
of said message information (m) in real space; 

(b) a specifier (24) for specifying at least one of 
the base regions (B) from said frequency spec- 
trum obtained as said first intermediate infor- 
mation (f 3 ); 

(c) a generator (25) for generating a frequency 
spectrum obtained by arranging said feature 
frequency components of said base region (B) 
at predetermined positions on frequency 
space, as second intermediate information 
(54); and 

(d) an extractor (26) for extracting said mes- 
sage information (m) hidden in said media in- 
formation (M) by performing inverse frequency 
transform of said second intermediate informa- 
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tion (54). 



inverse frequency transform of said second in- 
termediate information (54). 



6. A program storage medium (11) for executing a 
process of hiding message information (m) in media 
information (M) by a computer, the medium charac- s 
terised by the steps of: 

(a) obtaining a frequency spectrum which con- 
tains a plurality of frequency components (i v 

y with respect to said message information io 
(m); 

(b) extracting a base region (B) from said fre- 
quency spectrum of the message information 
(m), the base region (B) being defined as a re- '5 
gion which contains feature frequency compo- 
nents that represent features of said message 
information (m) in real space; 

(c) generating a plurality of copies of said base 20 
regbn (B) and generating a frequency spec- 
trum obtained by dispersedly arranging the re- 
spective copies in frequency space, as interme- 
diate information (f 3 ); and 



7. A program storage medium (11) for executing a 
process of extracting message information (m) from 
media information (M) with said message informa- 
tion hidden in frequency space by a computer, the 35 
medium characterised by the steps of: 

(a) obtaining, as first intermediate information 
(f 3 ), a frequency spectrum which has a plurality 

of base regions (B) by operating said media in- *o 
formation (M) with said message information 
(m) hidden therein, each of the base regions 
(B) containing feature frequency components 
which represent features of said message in- 
formation (m) in real space; 45 

(b) specifying at least one of the base regions 
(B) from said frequency spectrum obtained as 
said first intermediate information (f 3 ); 



25 




30 



so 



(c) generating a frequency spectrum obtained 
by arranging said feature frequency compo- 
nents of said base region (B) at predetermined 
positions on frequency space, as second inter- 
mediate information (54); and 



55 



(d) extracting said message information hidden 
(m) in said media information (M) by performing 
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